Statistical Machine Translation Dayak Language – Indonesia Language

نویسندگان

چکیده

This Paper aims to discuss how create the local language machine translation of Indonesia Language where reason selection was carried out as considering using translator for are still infrequently found mainly Dayak translator. Machine Translation on this research had used statistical approach resource data that taken originated from articles dayaknews.com pages with total parallel corpus approximately 1000 – furthermore contains sentences accordingly divided into three sections in order comprehend certain analysis a pattern created. The monolingual collected Language. testing Bilingual Evaluation Understudy (BLEU) tool and result highest accuracy value amounting 49.15% which increase some others 3%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Machine Translation with Local Language Models

Part-of-speech language modeling is commonly used as a component in statistical machine translation systems, but there is mixed evidence that its usage leads to significant improvements. We argue that its limited effectiveness is due to the lack of lexicalization. We introduce a new approach that builds a separate local language model for each word and part-of-speech pair. The resulting models ...

متن کامل

Randomised Language Modelling for Statistical Machine Translation

A Bloom filter (BF) is a randomised data structure for set membership queries. Its space requirements are significantly below lossless information-theoretic lower bounds but it produces false positives with some quantifiable probability. Here we explore the use of BFs for language modelling in statistical machine translation. We show how a BF containing n-grams can enable us to use much larger ...

متن کامل

Natural language understanding using statistical machine translation

Over the past years, automatic dialogue systems and telephonebased machine inquiry systems have received increasing attention. In addition to an automatic speech recognizer and a dialogue manager, such systems consist of a natural language understanding (NLU) component. Some of the most investigated approaches to NLU are rule-based methods as Stochastic Grammars, which are often written manuall...

متن کامل

Improved Language Modeling for Statistical Machine Translation

Statistical machine translation systems use a combination of one or more translation models and a language model. While there is a significant body of research addressing the improvement of translation models, the problem of optimizing language models for a specific translation task has not received much attention. Typically, standard word trigram models are used as an out-of-the-box component ...

متن کامل

Statistical Natural Language Genera Machine Translation

This paper presents a statistical natural language generation scheme for trainable speech-to-speech machine translation (MT) systems. The natural language generation scheme in the translation systems is based on a maximum entropy (ME) statistical model fully trained from a corpus, allowing flexible translation outputs. In this paper, the system architecture and some of its components, including...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Informatika Mulawarman

سال: 2021

ISSN: ['1858-4853', '2597-4963']

DOI: https://doi.org/10.30872/jim.v16i1.5315